Concurrent Neural Networks for Speaker Recognition
نویسنده
چکیده
We propose a new recognition model called Concurrent Neural Networks (CNN), representing a winner-takes-all collection of neural networks. Each network of the system is trained individually to provide best results for one class only. We have applied the above model for the task of speaker recognition. We performed distinct speaker recognition experiments using three variants of basic components of the CNN system: the Multi-Layer Perceptron (MLP), the TimeDelay Neural Network (TDNN) and the Kohonen Self-Organizing Map (SOM). We have used two databases: a clean speech database called SPEECHDATA and a telephone database called TELEPHDATA. The experiments proved a significant increase of the recognition score using the proposed CNN model by comparation to the use of a single neural network for the whole speaker recognition task. The SOM best has performed in our experiments proving an increase of about 38% for SPEECHDATA as well as an increase of about 30% for TELEPHDATA.
منابع مشابه
شبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملطراحی یک مدل مبتنی بر شبکههای عصبی برای شناسایی و تجزیه و تحلیل الگوهای غیرطبیعی در نمودارهای کنترل فرآیند
Neural networks because of their abilities are used to patterns recognition. In statistical process control charts, a common cause variation distort expected form of unnatural patterns and so detection of assignable causes efficiently and precisely in a real-time is difficult. Therefore it would be logical to propose models based neural networks for recognition and analysis of patterns in proce...
متن کاملA Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملProbabilistic Neural Networks Combined with Gmms for Speaker Recognition over Telephone Channels
In this paper we study the applicability of Probabilistic Neural Networks (PNNs) as core classifiers to medium scale speaker recognition over fixed telephone networks. In particular, banking applications with up to 400 enrolled speakers and short training times are targeted. Two PNN-based open-set text-independent systems for Speaker Identification and Speaker Verification correspondingly are p...
متن کاملModeling speaker variability using long short-term memory networks for speech recognition
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area of research. Considering that long short-term memory (LSTM) recurrent neural networks (RNNs) have been successfully applied to many sequence prediction and sequence labeling tasks, we propose to use LSTM RNNs for modeling speaker variability in automatic speech recognition (ASR). Firstly, the LST...
متن کامل